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DESCRIPTION 

VOICE/MUSICAL SOUND ENCODING DEVICE AND VOICS/MUSICAL 
SOUND ENCODING METHOD 

Technical Field 

[0001] The present invention relates to a voice/musical 
tone coding apparatus and voice/musical tone coding 
method that perform voice/musical tone signal 
transmission in a packet communication system typified 
by Internet communication, a mobile communication system, 
or the like. 

Background Art 

[0002] When a voice signal is transmitted in a packet 
communication system typified by Internet communication, 
a mobile communication system, or the like, compression 
and coding technology is used to increase transmission 
efficiency. To date, many voice coding methods have been 
developed, andmanyof the lowbit rate voice coding methods 
developed in recent years have a scheme in which a voice 
signal is separated into spectrum information and 
detailed spectrum structure information, and compression 
and decoding is performed on the separated items. 

[0003] Also, with the ongoing development of voice 
telephony environments on the Internet as typified by 
IP telephony, there is a growing need for technologies 
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that efficiently compress and transfer voice signals. 
[0004] In particular, various schemes relating to voice 
coding using human auditory masking characteristics are 
being studied. Auditory masking is the phenomenon 
whereby, when there is a strong signal component contained 
in a particular frequency, an adjacent frequency 
component cannot be heard, and this characteristic is 
used to improve quality. 

[0005] An example of a technology related to this is the 
method described in Non-Patent Literature 1 that uses 
auditory masking characteristics in vector quantization 
distance calculation. 

[0006] The voice coding method using auditory masking 
characteristics in Patent Literature 1 is a calculation 
method whereby, when a frequency component of an input 
signal and a code vector shown by a codebook are both 
in an auditory masking area, the distance in vector 
quantization is taken to be 0. 

Patent Document 1 : Japanese Patent Application Laid- Open 
No.HEI 8-123490 (p. 3, FIG.l) 

Disclosure of Invention 

Problems to be Solved by the Invention 

[0007] However, the conventional method shown in Patent 
Literature 1 can only be adapted to cases with limited 
input signals and code vectors, and sound quality 
performance is inadequate. 
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[0008} The present invention has been implemented taking 
into account the problems described above, and it is an 
object of the present invention to provide a high-quality 
voice/musical tone coding apparatus and voice/musical 
tone coding method that select a suitable code vector 
that minimizes degradation of a signal that has a large 
auditory effect. 

Means for Solving the Problems 

[0009] In order to solve the above problems, a 
voice/musical tone coding apparatus of the present 
invention has a configuration that includes : a quadrature 
transformation processing section that converts a 
voice/musical tone signal from time components to 
frequency components; an auditory masking characteristic 
value calculation section that finds an auditory masking 
characteristic value from the aforementioned 
voice/musical tone signal; and a vector quantization 
section that performs vector quantization changing an 
aforementioned frequency component and the calculation 
method of the distance between a code vector found from 
a preset codebook and the aforementioned frequency 
component based on the aforementioned auditory masking 
characteristic value. 

Advantageous Effect of the Invention 

[0010] According to the present invention, by performing 
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quantization changing the method of calculating the 
distance between an input signal and code vector based 
on an auditory masking characteristic value, it is 
possible to select a suitable code vector that minimizes 
degradation of a signal that has a large auditory effect, 
and improve input signal reproducibility and obtain good 
decoded voice. 

Brief Description of Drawings 
[OOllj 

FIG . 1 is a block configuration diagram of an overall 
system that includes a voice/musical tone coding 
apparatus and voice/musical tone decoding apparatus 
according to Embodiment 1 of the present invention; 

FIG, 2 is a block configuration diagram of a 
voice/musical tone coding apparatus according to 
Embodiment 1 of the present invention; 

PIG. 3 is ablock configuration diagram of an auditory 
masking characteristic value calculation section 
according to Embodiment 1 of the present invention; 

FIG. 4 is a drawing showing a sample configuration 
of critical bandwidths according to Embodiment 1 of the 
present invention; 

FIG . 5 is a flowchart of avector quantization section 
according to Embodiment 1 of the present invention; 

FIG. 6 is a drawing explaining the relative 
positional relationship of auditory masking 
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characteristic values, coding values, and MDCT 
coefficients according to Embodiment 1 of the present 
invention; 

PIG. 7 is a block configuration diagram of a 
voice/musical tone decoding apparatus according to 
Embodiment 1 of the present invention; 

FIG. 8 is a block configuration diagram of a 
voice/musical tone coding apparatus and voice/musical 
tone decoding apparatus according to Embodiment 2 of the 
present invention; 

FIG. 9 is a schematic configuration diagram of a CELP 
type voice coding apparatus according to Embodiment 2 
of the present invention; 

FIG. 10 is a schematic configuration diagram of a 
CELP type voice decoding apparatus according to 
Embodiment 2 of the present invention; 

FIG, 11 is a block configuration diagram of an 
enhancement layer coding section according to Embodiment 
2 of the present invention; 

FIG. 12 is a flowchart of a vector quantization 
section according to Embodiment 2 of the present 
invention; 

FIG. 13 is a drawing explaining the relative 
positional relationship of auditory masking 
characteristic values, coded values, and MDCT 
coefficients according to Embodiment 2 of the present 
invention; 
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FIG. 14 is ablock configuration diagram of a decoding 
section according to Embodiment 2 of the present 
invention ; 

FIG. 15 is a block configuration diagram of a voice 
signal transmitting apparatus and voice signal receiving 
apparatus according to Embodiment 3 of the present 
invention; 

FIG, 16 is a flowchart of a coding section according 
to Embodiment 1 of the present invention; and 

FIG. 17 is a flowchart of an auditory masking value 
calculation section according to Embodiment 1 of the 
present invention. 

Best Mode for Carrying Out the Invention 

[0012] Embodiments of the present invention will now be 

described in detail below with reference to the 

accompanying drawings. 
[0013] (Embodiment 1) 

FIG.l is a block diagram showing the configuration 

of an overall system that includes a voice/musical tone 

coding apparatus and voice/musical tone decoding 

apparatus according to Embodiment 1 of the present 

invention . 

[0014] This system is composed of voice/musical tone 
coding apparatus 101 that codes an input signal, 
transmission channel 10 3 , and voice /musical tone decoding 
apparatus 105 that decodes 
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[0015] Transmission channel 103 may be a wireless LAN, 
mobile terminal packet communication, Bluetooth, or 
suchlike radio communication channel, or may be an ADSL, 
FTTH, or suchlike cable communication channel. 

[0016] Voice/musical tone coding apparatus 101 codes 
input signal 100, and outputs the result to transmission 
channel 103 as coded information 102. 

[0017] Voice/musical tone decoding apparatus 105 
receives coded information 102 via transmission channel 
103, performs decoding, and outputs the result as output 

signal 106 . 

[0018] The configuration of voice/musical tone coding 
apparatus 101 will be described using the block diagram 
in FIG. 2. In FIG. 2, voice/musical tone coding apparatus 
101 is mainly composed of: quadrature transformation 
processing section 201 that converts input signal 100 
from time components to frequency components; auditory 
masking characteristic value calculation section 203 that 
calculates an auditory masking characteristic value from 
input signal 100; shape codebook 204 that shows the 
correspondence between an index and a normalized code 
vector; gain codebook 205 that relates to each normalized 
code vector of shape codebook 204 and shows its gain; 
and vector quantization section 202 that performs vector 
quantization of an input signal converted to the 
aforementioned frequency components using the 
aforementioned auditory masking characteristic value, 
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and the aforementioned shape codebook and gain codebook. 
[0019] The operation of voice/musical tone coding 
apparatus 10 1 wil 1 now bedescribedindetail in accordance 
with the procedure in the flowchart in FIG. 16, 
5 [0020] First, input signal sampling processing will be 
described. Voice/musical tone coding apparatus 101 
divides input signal 100 into sections of N samples {where 
N is a natural number), takes N samples as one frame, 
and performs coding on a frame -by- frame . Here, input 
10 signal 100 subject to coding will be represented as Xn 
(n = 0, A, N-l) , where n indicates that this is the n+l'th 
of the signal elements comprising the aforementioned 
divided input signal. 

[0021] Input signal Xn 100 is input to quadrature 
15 transformation processing section 201 and auditory- 
masking characteristic value calculation section 203. 
[0022] Quadrature trans format ion proce s s ing section 201 
has internal buffers buf^ (n - 0, A, N-l) for the 
aforementioned signal elements, and initializes these 
20 with 0 as the initial value by means of Equation (1). 
10023] 

buf„=Q (/7 = 0,'--,iV-l) [Equation 1] 

[0024] Quadrature transformation processing (step 
S1601) will now be described with regard to the calculation 
25 procedure in quadrature transformation processing 
section 201 and data output to an internal buffer. 
[0025] Quadrature trans format ion process ing section 201 
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performs a modified discrete cosine transform (MDCT) on 
input signal x„ 100 , and finds MDCT coefficient X), by means 
of Equation (2) . 
[0026] 

5 (*=0,....;^-l)tEq.atlon 21 

[0027] Here, k signifies the index of each sample in one 
frame . Quadrature transformation processing section 201 
finds Xn' , which is a vector linking input signal Xn 100 
and buffer buf^, by means of Equation (3) . 
10 [0028] 

x'„ = \ } ' .[Equation 3] 

[0 02 9 J Quadrature trans format ion proces sing section 201 
then updates buffer buf-, by means of Equation (4). 
[0030] 

15 buf,,=x„ (« = 0,---A^-]) [Equation 4] 

[0031] Next, quadrature transformation processing 
section 201 outputs MDCT coefficient Xk to vector 
quantization section 202. 

[0032] The configuration of auditory masking 

20 characteristic value calculation section 203 in FIG. 2 
will now be described using the block diagram in FIG . 3 . 
[0033] In FIG. 3, auditory masking characteristic value 
calculation section 2 03 is composed of : Fourier transform 
section 30X that performs Fourier transform processing 
25 of an input signal; power spectrum calculation section 
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302 that calculates a power spectrum from the 
aforementioned Fourier transformed input signal ; minimum 
audible threshold value calculation section 304 that 
calculates a minimum audible threshold value from an input 
5 signal ; memory buffer 305 that buffers the aforementioned 
calculated minimum audible threshold value ; and auditory 
masking value calculation section 303 that calculates 
an auditory masking value from the aforementioned 
calculated power spectrum and the aforementioned buffered 

10 minimum audible threshold value. 

[0034] Next, auditory masking characteristic value 
calculation processing {step S1602) in auditory masking 
characteristic value calculation section 203 configured 
as described above will be explained using the flowchart 

15 in FIG. 17. 

[0035] The auditory masking characteristic value 
calculation method is disclosed in a paper by Mr. J. 
Johnston et al (J.Johnston, "Estimation of perceptual 
entropy using noise masking criteria", inProc. ICASSP- 8 8 , 

20 May 1988, pp . 2524 - 2 52 7 ) . 

[0036] First, the operation of Fourier transform section 
301 will be described with regard to Fourier transform 
processing (step S1701). 

[0037] Fourier transform section 301 has input signal 
25 Xn 100 as input, and converts this to a frequency domain 
signal Fk by means of Equation (5) . Here, eis the natural 
logarithm base, and k is the index of each sample in one 
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frame . 



[0038] 



(it = 0,-",iV-l) [Equation 5] 



[0039] Fourier transform section 301 then outputs 
5 obtained to power spectrum calculation section 302. 
[0040] Next, power spectrum calculation processing 
(step S1702) will be described. 

[0041] Power spectrum calculation section 302 has 
frequency domain signal Fk output from Fourier transform 
10 section 301 as input, and finds power spectrum of F^ 
by means of Equation {6} . Here, k is the index of each 
sample in one frame. 



15 [0043] In Equation (6) , Fk"*^ is the real part of frequency 

domain signal F^ , and is found by power spectrum 
calculation section 302 by means of Equation (7) . 



20 [0045] Also, Fk'^'" is the imaginary part of frequency 

domain signal F^ , and is found by power spectrum 
calculation section 302 by means of Equation (8). 
[0046] 



[0042] 



A=(^i''')'+(^*"")' (A = 0,-",A'-l) [Equation 6] 



[0044] 
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[0047] Power spectruin calculation section 302 then 

outputs obtained power spectrum Pj^ to auditory masking 
value calculation section 303. 

[0048] Next, ininiTTium audible threshold value 

5 calculation processing (step S1703) will be described. 
[0049] Minimum audible threshold value calculation 
section 304 finds minimum audible threshold value athj; 
in the first frame only by means of Equation (9) . 
[0050] 

10 alh, =3.64(Vl000r*-6.5e-**'<'/""°''^"^+I0-='(Vl000)^ {k^O,-- ,N -\) 

[Equation 9] 

[0051] Next, memory buffer storage processing (step 
S1704) will be described. 

[0052] Minimum audible threshold value calculation 
15 section 304 outputs minimum audible threshold value ath^ 
to memory buffer 305. Memory buffer 305 outputs input 
minimum audible threshold value ath^ to auditory masking 
value calculation section 303. Minimum audible 
threshold value athk is determined for each frequency 
20 component based on human hearing, and a component equal 
to or smaller than ath^ is not audible. 
[0053] Next, the operation of auditory masking value 
calculation section 303 will be described with regard 
to auditory masking value calculation processing {step 
25 S1705) . 

[0054] Auditory masking value calculation section 303 
has power spectrum Pk output from power spectrum 
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calculation section 302 as input, and divides power 
spectrum Pk into m critical bandwidths . Here, a critical 
bandwidth is a threshold bandwidth for which the amount 
by which a pure tone of the center frequency is masked 
5 does not increase even if band noise is increased. FIG. 4 
shows a sample critical bandwidth configuration. In 
FIG. 4, m is the total number of critical bandwidths, and 
power spectrum Pk is divided into m critical bandwidths. 
Also, i is the critical bandwidth index, and has a value 
10 from 0 to m-l. Furthermore, bhi and bli are the minimum 
frequency index and maximum frequency index of each 
critical bandwidth I, respectively. 

[0055] Next, auditory masking value calculation section 
303 has power spectrum output from power spectrum 
15 calculation section 302 as input, and finds power spectrum 
Bi calculated for each critical bandwidth by means of 
Equation ( 10 ) , 
[0056] 

^/ = I]^* O' = 0,--', m-l) [Equation 10] 

20 [0057] Auditory masking value calculation section 303 
then finds spreading function SF(t) by means of Equation 

(11) . 

Spreading function SF(t) is used to calculate, for each 
frequency component, the effect {simultaneous masking 
25 effect) that that frequency component has on adjacent 
frequencies . 
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[0058] 



SF{t) = ] 5 .8 1 1 39 + 7.50 + 0.474)- 1 7.5Vl + (' + 0.474)' {; = 0, ■ • • , A^, - 1 ) 

[Equation 11] 

[0059] Here, Nt is a constant set beforehand within a 
range that satisfies the condition in Equation (12), 
[0060] 

0<jV, <m [Equation 12] 
[0061] Next, auditory masking value calculation section 
303 finds constant Ci using power spectrum Bi and spreading 
function SF ( t ) added for each critical bandwidth by means 
of Equation (13 ) . 
[0062] 

f^B,.SF{t) {i<N,) 

Y,B,-SF{t) (;V, <i<A'-A',) [Equation 13] 
Y,B,-SF{t) {i>N-N,) 

[0063] Auditory masking value calculation section 303 
then finds geometric mean ui^ by means of Equation (14) , 
[0064] 

=10 (/ = 0,---,wj-l) [Equation 14] 

[0065] Auditory masking value calculation section 303 
then finds arithmetic mean pi^ by means of Equation (15) . 

[0066] 



i=2L^k {bli-bh) (/ = 0,-,/«~l) [Equation 15] 
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[0067] Auditory masking value calculation section 303 
then finds SFMi (Spectral Flatness Measure) by raeans of 
Equation (16) . 

[0068] 

5 SFM,=i^f If^" (/ = 0,-",m-l) [Equation 16] 

[0069] Auditory masking value calculation section 303 
then finds constant cxi by means of Equation (17) . 
[0070] 

^^^^^^±}0g^M^^ (/ = 0...-.,«-l) [Equation 17] 

10 [0071] Auditory masking value calculation section 303 
then finds offset value Oi for each critical bandwidth 
by means of Equation (18) , 
[0072] 

=a:;-{l4.5 + /)-f5.5-(l~aj) (/ = 0,---,ot-I) [Equation 18] 
15 [0073] Auditory masking value calculation section 303 
then finds auditory masking value Ti for each critical 
bandwidth by means of Equation (19). 

[0074] 

7^^JlOios,.(c,m/io)/(^^^_^^j (/ = 0,-.,w-l) [Equation 19] 
20 [0075] Auditory masking value calculation section 303 
then finds auditory masking characteristic value Mj. from 

minimum audible threshold value ath^ output from memory 
buffer 305 by means of Equation (20), and outputs this 
to vector quantization section 202. 
25 [00 76] 

M;. =max{<?//;i ,/;) (^ = Wjp'-',W^ ,/ = 0, — ,w-l) [Equation 20] 
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[0077] Next, codebook acquisition processing (step 
S1603} and vector quantization processing (step S1604) 
in vector quantization section 202 will be described in 
detail using the process flowchart in FIG. 5. 
5 [0078] Using shape codebook 204 and gain codebook 205, 
vector quantization section 202 perforins vector 
quantization of MDCT coefficient Xk from MDCT coefficient 
Xk output from quadrature transformation processing 
section 201 and an auditory masking characteristic value 

10 output from auditory masking characteristic value 
calculation section 203, and outputs obtained coded 
information 102 to transmission channel 103 in FIG . 1 . 
[0079] The codebooks will now be described. 
[0080] Shape codebook 204 is composed of previously 

15 created Nj kinds of N- dimens ional code vectors codeK-^ (j 
= 0, A, Nj-1, k = 0, A, N-1), and gain codebook 205 is 
composed of previously created Na kinds of gain codes gain'^ 
{ j = 0, A, Nd-1) . 

[0081] In step 501, initialization is performed by 
20 assigning 0 to code vector index j in shape codebook 204, 

and a sufficiently large value to minimum error DistMi«. 

[0082] In step 502, N- dimensional code vector codek^ (k 

= 0, A, N"l) is read from shape codebook 204. 

[0083] In step 503, MDCT coefficient X^ output from 
25 quadrature transformation processing section 201 is input , 

and gain Gain of code vector codejc-' {k = 0, A, N-1) read 

in shape codebook 204 in step 502 is found by means of 
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Equation (21) . 
[0084] 

Gam^^Xi^-codeij^codel^ [Equation 21] 

[0085] In Step 504, 0 is assigned to calc_count 
5 indicating the number of executions of step 505. 

[0086] In step 505, auditory masking characteristic 
value Mfc output from auditory masking characteristic value 
calculation section 203 is input , and temporary gain tempk 
(k = 0, A, N-l) is found by means of Equation (22). 
10 [0087] 

(codei \codel •Gai)\>Mk^ 
, ^ . \ (A; = 0,---,A^-1) [Equation 22] 

0 \codei-Gam\<M,] 

[0088] In Equation (22), if k satisfies the condition 

j codek-' • Gain I ^Mk, code^^ is assigned to temporary gain 
tempk, and if k satisfies the condition | codej,^ • Gain | <Mk , 
15 0 is assigned to temporary gain tempi^ . 

[0089] Then, in step 505, gain Gain for an element that 
is greater than or equal to the auditory masking value 
is found by means of Equation (23). 

[0090] 

20 Gij/?7 = ^X^ = 0,-.-,jV -l) [Equation 23] 

[0091] If temporary gain tempk is 0 for all k's, 0 is 
assigned to gain Gain. Also, coded value Rr is found from 
gain Gain and coden^ by means of Equation (24) . 
[0092] 
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R^=Gain-codei (A = 0,---,A^-l) [Equation 24] 
[0093] In step 506, calc_count is incremented by 1. 
[0094] In step 507, calc_count and a predetermined 
non-negative integer Nc are compared, and the process flow 
5 returns to step 505 if calc_count is a smaller value than 
Nc, or proceeds to step 508 if calc_count is greater than 
or equal to Nc . By repeatedly finding gain Gain in this 
way, gain Gain can be converged to a suitable value. 
[0095] In step 508, 0 is assigned to cumulative error 

10 Dist, and 0 is also assigned to sample index k. 

[0096] Next, in steps 509, 511, 512, and 514, case 
determination is performed for the relative positional 
relationship between auditory masking characteristic 
value Mk, coded value , and MDCT coefficient Xk, and 

15 distance calculation is performed in step 510, 513, 515, 
or 516 according to the case determination result. 
[0097] This case determination according to the relative 
positional relationship is shown in FIG. 6. In FIG. 6, a 
white circle symbol (o) signifies an input signal MDCT 

20 coefficient Xk, and a black circle symbol {•) signifies 
a coded value Rk • The items shown in FIG . 6 show the special 
characteristics of the present invention, and the area 
from the auditory masking characteristic value found by 
auditory masking characteristic value calculation 

25 section 203 +Mk to 0 to -Mk is referred to as the auditory 
masking area, and high-quality results closer in terms 
of the sense of hearing can be obtained changing the 
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distance calculation method when input signal MDCT 
coefficient or coded value Rk is present in this auditory 
masking area. 

[0098] The distance calculation method in vector 
5 quantization according to the present invention will now 
be described. When neither input signal MDCT coefficient 
Xk (o) nor coded value (•) is present in the auditory 
masking area, and input signal MDCT coefficient Xk and 
coded value Rk are the same codes, as shown in "Case 1" 

10 in FIG. 6, distance Dn between input signal MDCT 
coefficient (o) and coded value Rk (•) is simply 

calculated. When one of input signal MDCT coefficient 
Xk (o) and coded value Rk (•) is present in the auditory 
masking area, as shown in "Case 3" and "Case 4" in FIG. 6, 

15 the position within the auditory masking area is corrected 
to an Mk value (or in some cases a -Mk value ) and D31 or 
D-11 is calculated. When input signal MDCT coefficient 
Xk (o) and coded value Rk {•) straddle the auditory masking 
area, as shown in "Case 2" in PIG. 6, the 

20 inter-auditory-masking- area distance is calculated as 
P*D23 (where p is an arbitrary coefficient} . When input 
signal MDCT coefficient Xk (o) and coded value Rk (•) are 
both present within the auditory masking area, as shown 
in "Case 5" in FIG. 6, distance D51 is calculated as 0. 

25 [0099] Next, processing in step 509 through step 517 for 
each of the cases will be described. 

[0100] In step 509, whether or not the relative 
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positional relationship between auditory masking 
characteristic value Mr. coded value Rj; , and MDCT 
coefficient Xk corresponds to "Case 1" in FIG. 6 is 
determined by means of the conditional expression in 
5 Equation (25) . 
toioi] 

(|Xj|>jV/J and (|/?J>A/J and (JT^ -iJ^ > O) [Equation 25] 
[0102] Equation (25) signifies a case in which the 
absolute value of MDCT coefficient Xk and the absolute 

10 value of coded value Rk are both greater than or equal 
to auditory masking characteristic value Mk, and MDCT 
coefficient Xk and coded value Rk are the same codes. If 
auditory masking characteristic value M^, MDCT 
coefficient Xk, and coded value Rk satisfy the conditional 

15 expression in Equation (25), the process flow proceeds 
to step 510, and if they do not satisfy the conditional 
expression in Equation (25), the process flow proceeds 
to step 511. 

[0103] In step 510, error Disti between coded value Rk 
20 and MDCT coefficient Xk is found bymeans of Equation (26), 
error Disti is added to cumulative error Dist, and the 
process flow proceeds to step 517. 

[0104] 

Dist^ =jD|, 

= \X.-R I ^^'^^^^^^'^ 

25 [0105] In step 511, whether or not the relative 
positional relationship between auditory masking 
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characteristic value M^, coded value Rk, and MDCT 
coefficient Xj; corresponds to "Case 5" in FIG. 6 is 
determined by means of the conditional expression in 
Equation ( 27 ) . 
5 [0106] 

(jX,|>Mj and (j/?J>A/J and (A'^ -yj^. <0) [Equation 27] 
t0107] Equation (27) signifies a case in which the 
absolute value of MDCT coefficient Xk and the absolute 
value of coded value Rjt are both less than or equal to 

10 auditory masking characteristic value Mk- If auditory 
masking characteristic value Mk, MDCT coefficient Xk, and 
coded value Rk satisfy the conditional expression in 
Equation (27) , the error between coded value Rk and MDCT 
coefficient Xk is taken to be 0, nothing is added to 

15 cumulative error Dist, and the process flow proceeds to 
step 517, whereas if they do not satisfy the conditional 
expression in Equation (27), the process flow proceeds 
to step 512 . 

[0108] In step 512, whether or not the relative 
20 positional relationship between auditory masking 
characteristic value Mk , coded value Rk , and MDCT 
coefficient Xk corresponds to "Case 2" in FIG. 6 is 
determined by means of the conditional expression in 
Equation ( 28 ) . 
25 [010 9] 

Dm, =D,, + Dj, -f yS*/),, [Equation 28] 
[0110] Equation (28) signifies a case in which the 
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absolute value of MDCT coefficient and the absolute 
value of coded value Rr are both greater than or equal 
to auditory masking characteristic value Mjj , and MDCT 
coefficient Xk and coded value are different codes. 
5 If auditory masking characteristic value Mk , MDCT 
coefficient X^, and coded value satisfy the conditional 
expression in Equation (28) , the process flow proceeds 
to step 513, and if they do not satisfy the conditional 
expression in Equation (28), the process flow proceeds 

10 to step 514. 

[0111] In step 513, error Dist2 between coded value Rk 
and MDCT coefficient Xjt is found by means of Equation (29) , 
error Dist2 is added to cumulative error Dist, and the 
process flow proceeds to step 517. 

15 [0112] 

D,x=\Xt\- Ml, [Equation 29] 
[0113] Here, (5 is value set as appropriate according to 
MDCT coefficient X^, coded value Rjc , and auditory masking 
characteristic value M^ . A value of 1 or less is suitable 
20 for (J, and a numeric value found experimentally by subject 
evaluation may be used. 

■Dair ^22/ and D23 are found by 
means of Equation (30) , Equation (31) , and Equation (32) , 
respectively . 
[0114] 

25 jD„ =jJ?^|-M^ [Equation 30] 

[0115] 

£>23 =,Mi -2 [Equation 31] 
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[0116] 

(|Xi|>Mj and ^R,\<M,) [Equation 32] 
[0117] In step 514, whether or not the relative 
positional relationship between auditory masking 
5 characteristic value Mk, coded value Rk, and MDCT 
coefficient Xk corresponds to "Case 3" in FIG, 6 is 
determined by means of the conditional expression in 
Equation ( 33 ) . 
[0118] 

Disl.=D,. 

10 , ^ . [Equation 33] 

= 1^*1 

[0119] Equation (33) signifies a case in which the 
absolute value of MDCT coefficient Xjc is greater than or 
equal to auditory masking characteristic value , and 
coded value Rk is less than auditory masking 

15 characteristic value . If auditory masking 

characteristic value , MDCT coefficient , and coded 
value satisfy the conditional expression in Equation 
(33) , the process flow proceeds to step 515, and if they 
do not satisfy the conditional expression in Equation 

20 (33), the process flow proceeds to step 516. 

[0120] In step 515, error Dista between coded value R^ 
and MDCT coefficient X;, is found bymeans of Equation (34) , 
error Dist3 is added to cumulative error Dist, and the 
process flow proceeds to step 517. 

25 [0121] 

(jX^|<Mj and (jy?^.|>Mj [Equation 34] 
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[0122] Instep 516, the relative positional relationship 
between auditory masking characteristic value M'^, coded 
value Rk, and MDCT coefficient corresponds to "Case 
4" in FIG. 6, and the conditional expression in Equation 
5 (35) is satisfied. 

[0123] 

(jjr,j<M,) and i\R,\< M,) [Equation 3 5] 
[0124] Equation (35) signifies a case in which the 
absolute value of MDCT coefficientx^ is less than auditory 

10 masking characteristic value Mk, and coded value is 
greater than or equal to auditory masking characteristic 
value Mk . In step 516, error Disti between coded value 
Rk and MDCT coefficient Xk is found by means of Equation 
(36) , error Dist^ is added to cumulative error Dist, and 

15 the process flow proceeds to step 517. 
[0125] 

[0126] In step 517, k is incremented by 1. 
[0127] In step 518, N and k are compared, and if k is 
20 a smaller value than N, the process flow returns to step 
509. If k has the same value as N, the process flow proceeds 
to step 519. 

[0128] In step 519, cumulative error Dist and minimum 
error DistKiN are compared, and if cumulative error Dist 
25 is a smaller value than minimum error DistMiN, the process 
flow proceeds to step 520, whereas if cumulative error 
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Dist is greater than or equal to minimum error DistwiN/ 
the process flow proceeds to step 521. 

[0129] In step 520, cumulative error Dist is assigned 
to minimum error DistMis/ j is assigned to code_indexMiK, 
5 and gain Gain is assigned to error minimum gain DistmN, 
and the process flow proceeds to step 521. 
[0130] In step 521, j is incremented by 1. 
[0131] In step 522, total number of vectors Nj and j are 
compared, and if j is a smaller value than Nj , the process 

10 flow returns to step 502. If j is greater than or equal 
to Nj , the process flow proceeds to step 523. 
[0132] In step 523, Na kinds of gain code gain'^ (d = 0, 
A, Nd-1) are read f rom gain codebook 2 0 5 , and quantization 
gain error gainerr^ (d = 0, A, Na-l) is found by means 

15 of Equation (37) for all d's. 
[0133] 

gainerr '' = \paWf^^i^. - gain''^ (i/ ~ 0, ■ ■ • , A^j - I ) [ E q u a t i o n 3 7] 
[0134] Then, in step 523, d for which quantization gain 
error gainerr*^ (d ™ 0, h, Nd-1) is a minimum is found, 

20 and the found d is assigned to gain_indexMiN ■ 

[0135] In step 524, code_indexMiN that is the code vector 
index for which cumulative error Dist is a minimum, and 
gain_indexHiN found in step 523 , are output to transmission 
channel 103 in FIG . 1 as coded information 102, and 

25 processing is terminated. 

[0136] This completes the description of coding section 
101 processing. 
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10137] Next, voice/musical tone decoding apparatus 105 
inPIG.lwillbe described using the detailed block diagram 
in FIG. 7. 

[0138] Shape codebook 204 and gain codebook 205 are the 
5 same as those shown in FIG. 2. 

[0139] Vector decoding section 701 has coded information 
102 transmitted via transmission channel 103 as input, 
and using code_indexHTN and gain_indexHTs as the coded 
information, reads code vector codek'^"'^^-'-"'^'^''"*" (k = 0, 

10 A, from shape codebook 204, and also reads gain code 

gain^'''"-^"'^'*''"^*' from gain codebook 205. Then vector 
decoding section 701 multiplies gain^^^"-^"^'''^"^*^ by 
codek=°<^^-^"^^^"^^ (k= 0, A, N-1) , and outputs gain^^^"-^"'^'^'''-'^^" 
X codek'^"^^-^"^'^''"^" (k = 0, A, N-1) obtained as a result 

15 of the multiplication to quadrature transformation 
processing section 702 as a decoded MDCT coefficient. 
[0140] Quadrature transformation processing section 702 
has an internal buffer bufk' , and initializes this buffer 
in accordance with Equation (38). 

20 [0141] 

buft=0 (A = 0,-'-,Af-l) [Equation 38] 
[0142] Next, decoded MDCT coefficient gain^^'"-'"'^"''"^*' x 

codek'^"'*^-^"'^''''-'^^'^ {k = 0. A, N-1) output from MDCT coefficient 
decoding section 701 is input, and decoded signal Yn is 
25 found by means of Equation {39) . 
[0143] 
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("^O.-.A'-O [Equation 39) 

[0144] Here, Xk ' is a vector linking decoded MDCT 
coefficient gain^^^"--"^^^"^" x codek"^^-^"^^^"^" {k = 0, A, 
N-1} and buffer bufh' , and is found by means of Equation 
5 (40) . 
[0145] 

X\=\ ' ' ^ ' TEquation 40] 

[0146] Buf fer bufk' is then updated by means of Equation 
(41) . 
10 [0147] 

buf\ = gain""" -codef' {k = 0,---N~\) [Equation 41] 
[0148] Decoded signal Yn is then output as output signal 
106 . 

[0149] By thus providing a quadrature transformation 
15 processing section that finds an input signal MDCT 
coefficient, an auditory masking characteristic value 
calculation section that finds an auditory masking 
characteristic value, and a vector quantization section 
that performs vector quantization using an auditory 
20 masking characteristic value, and performing vector 
quantization distance calculation according to the 
relative positional relationship between an auditory 
masking characteristic value, MDCT coefficient, and 
quantized MDCT coefficient, it is possible to select a 
25 suitable code vector that minimizes degradation of a 



2F04237-PCT 28 

signal that has a large auditory effect, and to obtain 
a high-quality output signal. 

[0150] It is also possible to perform quantization in 
vector quantization section 202 by applying acoustic 
5 weighting filters for the distance calculations in 
above-described Case 1 through Case 5. 

[0151] Also, in this embodiment, a case has been 
described in which MDCT coefficient coding is performed, 
but the present invention can also be applied, and the 

10 same kind of actions and effects can be obtained, in a 
case in which post- transformation signal {frequency 
parameter) coding is performed using Fourier transform, 
discrete cosine transform (DOT) , or quadrature mirror 
filter (QMF) or suchlike quadrature transformation. 

15 [0152] Furthermore, in this embodiment, a case has been 
described in which coding is performed by means of vector 
quantization, but there are no restrictions on the coding 
method in the present invention, and , for example, coding 
may also be performed by means of divided vector 

20 quantization or multi-stage vector quantization. 

[0153] It is also possible for voice/ musical tone coding 
apparatus 101 to have the procedure shown in the flowchart 
in FIG. 16 executed by a computer by means of a program. 
[0154] As described above, by calculating an auditory 

25 masking characteristic value from an input signal, 
considering all relative positional relationships of MDCT 
coefficient, coded value, and auditory masking 
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characteristic value , and applying a distance calculation 
method suited to human hearing, it is possible to select 
a suitable code vector that minimizees degradation of 
a signal that has a large auditory effect, and to obtain 
5 good decoded voice even when an input signal is decoded 
at a low bit rate. 

[0155] In Patent Literature 1, only "Case 5" in FIG. 6 
is disclosed, but with the present invention, in addition 
to this, by employing a distance calculation method that 

10 takes an auditory masking characteristic value into 
consideration for all combinations of relationships as 
shown in "Case 2," "Case 3," and "Case 4," considering 
all relative positional relationships of input signal 
MDCT coefficient, coded value, and auditory masking 

15 characteristic value , and applying a distance calculation 
method suited to hearing, it is possible to obtain 
higher-quality coded voice even when an input signal is 
quantized at a low bit rate. 

[0156] Also, the present invention is based on the fact 
20 that actual audibility differs if distance calculation 
is performed without change and vector quantization is 
then performed when an input signal MDCT coefficient or 
coded value is present within the auditory masking area, 
and when present on either side of the auditory masking 
25 area, and therefore more natural audibility can be 
provided changing the distance calculation method when 
performing vector quantization. 
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[0157] (Embodiment 2) 

In Embodiment 2 of the present invention, an example 
is described in which vector quantization using the 
auditory masking characteristic values described in 
5 Embodiment 1 is applied to scalable coding. 

[0158] In this embodiment, a case is described below in 
which, in a two-layer voice coding and decoding method 
composed of a base layer and enhancement layer, vector 
quantization is performed using auditory masking 

10 characteristic value in the enhancement layer. 

[0159] A scalable voice coding method is a method whereby 
a voice signal is split into a plurality of layers based 
on frequency characteristics and coding is performed. 
Specifically, signals of each layer are calculated using 

15 a residual signal representing the difference between 
a lower layer input signal and a lower layer output signal . 
On the decoding side, the signals of these layers are 
added and a voice signal is decoded. This technique 
enables sound quality to be controlled flexibly, and also 

20 makes noise- tolerant voice signal transfer possible. 
[0160] In this embodiment, a case in which the base layer 
performs CELP type voice coding and decoding will be 
described as an example, 

[0161] FIG.Sisablock diagram showing the configuration 
25 of a coding apparatus and decoding apparatus that use 
an MDCT coefficient vector quantization method according 
to Embodiment 2 of the present invention. In FIG. 8, the 
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coding apparatus is composed of base layer coding section 
801, base layer decoding section 803, and enhancement 
layer coding section 805, and the decoding apparatus is 
composed of base layer decoding section 808 , enhancement 
5 layer decoding section 810, and adding section 812. 
[0162] Base layer coding section 801 codes an input 
signal 800 using a CELP type voice coding method, 
calculates base layer coded information 802, and outputs 
this to base layer decoding section 803, and to base layer 

10 decoding section 808 via transmission channel 807. 

[0153] Base layer decoding section 803 decodes base layer 
coded information 802 using a CELP type voice decoding 
method, calculates base layer decoded signal 804, and 
outputs this to enhancement layer coding section 805. 

15 [0164] Enhancement layer coding section 805 has base 
layer decoded signal 804 output by base layer decoding 
section 803, and input signal 800, as input, codes the 
residual signal of input signal 800 and base layer decoded 
signal 804 by means of vector quantization using an 

20 auditory masking characteristic value, and outputs 
enhancement layer coded information 806 found by means 
of quantization to enhancement layer decoding section 
810 via transmission channel 807. Detailsof enhancement 
layer coding section 805 will be given later herein. 

25 [0165] Base layer decoding section 808 decodes base layer 
coded information 802 using a CELP type voice decoding 
method, and outputs a base layer decoded signal 809 found 
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by decoding to adding section 812. 

[0166] Enhancement layer decoding section 810 decodes 
enhancement layer coded information 806, and outputs 
enhancement layer decoded signal 811 found by decoding 
5 to adding section 812. 

[0167] Adding section 812 adds together base layer 
decoded signal 8 09 output from base layer decoding section 
808 and enhancement layer decoded signal 811 output from 
enhancement layer decoding section 810, and outputs the 
10 voice/musical tone signal that is the addition result 
as output signal 813. 

[0168] Next, base layer coding section 801 will be 
described using the block diagram in FIG . 9 , 
[0169] Input signal 800 of base layer coding section 801 

15 is input to a preprocessing section 901. Preprocessing 
section 901 performs high pass filter processing that 
removes a DC component, and waveform shaping processing 
and pre-emphasis processing aiming at performance 
improvement of subsequent coding processing, and outputs 

20 the signal (Xin) that has undergone this processing to 
LPC analysis section 902 and adding section 905. 
[0170] LPC analysis section 902 performs linear 
prediction analysis using Xin, and outputs the analysis 
result (linear prediction coefficient) to LPC 

25 quantization section 903 . LPC quantization section 903 
performs quant i zat ion process ing of the linearprediction 
coefficient (LPC) output from LPC analysis section 902, 
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outputs the quantized LPC to combining filter 904, and 
also outputs a code (L) indicating the quantized LPC to 
multiplexing section 914. 

[0171] Using a filter coefficient based on the quantized 
5 LPC, combining filter 904 generates a composite signal 
by performing filter combining on a drive sound source 
output from an adding section 911 described later herein, 
and outputs the composite signal to adding section 905. 
[0172] Adding section 905 calculates an error signal by 
10 inverting the polarity of the composite signal and adding 
it to Xin, and outputs the error signal to acoustic 
weighting section 912. 

[0173] Adaptive sound source codebook 906 stores a drive 
sound source output by adding section 911 in a buffer, 

15 extracts one frame's worth of samples from a past drive 
sound source specified by a signal output from parameter 
determination section 913 as an adaptive sound source 
vector, and outputs this to multiplication section 909. 
[0174] Quantization gain generation section 907 outputs 

20 quantization adaptive sound source gain specified by a 
signal output from parameter determination section 913 
and quantization fixed sound source gain to 
multiplication section 909 and a multiplication section 
910, respectively. 

25 [0175] Fixed sound source codebook 908 multiplies a pulse 
sound source vector having a form specified by a signal 
output from parameter determination section 913 by a 



2F04237-PCT 34 

spreading vector, and outputs the obtained fixed sound 
source vector to multiplication section 910. 
[0176] Multiplication section 909 multiplies 

quantization adaptive sound source gain output from 
quantization gain generation section 907 by the adaptive 
sound source vector output from adaptive sound source 
codebook 906, and outputs the result to adding section 
911. Multiplication section 910 multiplies the 
quantization fixed sound source gain output from 
quantization gain generation section 907 by the fixed 
sound source vector output from fixed sound source 
codebook 908, and outputs the result to adding section 
911 . 

[0177] Adding section 911 has as input the 

post-gain-multiplication adaptive sound source vector 
and fixed sound source vector from multiplication section 
909 and multiplication section 910 respectively, and 
outputs the drive sound source that is the addition result 
to combining filter 904 and adaptive sound source codebook 
906. The drive sound source input to adaptive sound 
source codebook 906 is stored in a buffer. 

[0178] Acoustic weighting section 912 performs acoustic 
weighting on the error signal output from adding section 
905, and outputs the result to parameter determination 
section 913 as coding distortion. 

[0179] Parameter determination section 913 selects from 
adaptive sound source codebook 906, fixed sound source 
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codebook 908, and quantization gain generation section 
907, the adaptive sound source vector, fixed sound source 
vector, and quantization gain that minimize coding 
distortion output from acoustic weighting section 912, 
5 and outputs an adaptive sound source vector code (A) , 
sound source gain code (G) , and fixed sound source vector 
code (F) indicating the selection results to multiplexing 
section 914 , 

[0180] Multiplexing sect ion 914 has a code (L) indicating 
10 quantized LPC as input from LPC quantization section 903, 
and code (A) indicating an adaptive sound source vector, 
code (F) indicating a fixed sound source vector, and code 
(G) indicating quantization gain as input from parameter 
determination section 913 , multiplexes this information, 
15 and outputs the result as base layer coded information 
802 . 

[0181] Base layer decoding section 803 (808) will now 
be described using FIG. 10. 

[0182] In FIG. 10, base layer coded information 802 input 
20 to base layer decoding section 803 (808) is separated 
into individual codes (L, A, G, F) by demultiplexing 
section 1001. Separated LPC code (L) is output to LPC 
decoding section 1002, separated adaptive sound source 
vector code (A) is output to adaptive sound source codebook 
25 1005, separated sound source gain code (G) is output to 
quantization gain generation section 1006, and separated 
fixed sound source vector code (F) is output to fixed 
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[0183] LPC decoding section 1002 decodes a quantized LPC 
from code (L) output from demultiplexing section 1001, 
and outputs the result to combining filter 1003. 
5 [0184] Adaptive sound source codebook 1005 extracts one 
frame's worth of samples from a past drive sound source 
designated by code (A) output from demultiplexing section 
1001 as an adaptive sound source vector, and outputs this 
to multiplication section 1008. 

10 [0185] Quantization gain generation section 1106 
decodes quantization adaptive sound source gain and 
quantization fixed sound source gain designated by sound 
source gain code (G) output from demultiplexing section 
1001, and outputs this to multiplication section 1008 

15 and multiplication section 1009. 

[0186] Fixed sound source codebook 1007 generates a fixed 
sound source vector designated by code (F) output from 
demultiplexing section 1001, and outputs this to 
multiplication section 1009. 

20 [0187] Multiplication section 1008 multiplies the 
adaptive sound source vector by the quantization adaptive 
sound source gain, and outputs the result to adding section 
1010. Multiplication section 1009 multiplies the fixed 
sound source vector by the quantization fixed sound source 

25 gain, and outputs the result to adding section 1010. 
[0188] Adding section 1010 performs addition of the 
post-gain-multiplication adaptive sound source vector 
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and fixed sound source vector output from multiplication 
section 1008 and multiplication section 1009, generates 
a drive sound source, and outputs this to combining filter 
1003 and adaptive sound source codebook 1005. 
5 [0189] Using the filter coefficient decoded by LPC 
decoding section 1002, combining filter 1003 performs 
filter combining of the drive sound source output from 
adding section 1010, and outputs the combined signal to 
postprocessing section 1004. 

10 [0190] Postprocessing section 1004 executes, on the 
signal output from combining filter 1003 , processing that 
improves the subjective voice sound quality such as 
formant emphasis and pitch emphasis, processing that 
improves the subjective sound quality of stationary noise, 

15 and so forth, and outputs the resulting signal as base 
layer decoded signal 804 (810) . 

[0191] Enhancement layer coding section 805 will now be 
described using FIG, 11, 

[0192] Enhancement layer coding section 805 in FIG. 11 
20 is similar to that shown in FIG . 2 , except that di f ferent ial 
signal 1102 of base layer decoded signal 804 and input 
signal 800 is input to quadrature transformation 
processing section 1103, and auditory masking 
characteristic value calculation section 203 is assigned 
25 the same code as in FIG , 2 and is not described here. 
[0193] As with coding section 101 of Embodiment 1, 
enhancement layer coding section 805 divides input signal 
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800 into sections of N samples (where N is a natural number) 
takes N samples as one frame, and performs coding on a 
frame -by- frame basis. Here, input signal 800 subject to 
coding will be designated Xn (n = 0, A, N-l}. 
5 [0194] Input signal Xn 800 is input to auditory masking 
characteristic value calculation section 203 and adding 
section 1101 . Also, base layer decoded signal 804 output 
from base layer decoding section 803 is input to adding 
section 1101 and quadrature transformation processing 

10 section 1103 . 

[0195] Adding section 1101 finds residual signal 1102 
xresidn (n = 0, A, N-l) by means of Equation (42), and 
outputs residual signal 1102 xresidn to quadrature 
transformation processing section 1103. 

15 [0196] 

xresid„=x„-xbase„ (h = O,---, A/ -l) [Equat ion 42] 
[0197] Here, xbasen (n = 0, A, N~l) is base layer decoded 
signal 804. Next, the process performed by quadrature 
trans formation processing section 1103 will be described. 

20 [0198] Quadrature transformation processing section 
1103 has internal buffers bufbasen (n = 0, A, N-l) used 
in base layer decoded signal xbasen 804 processing, and 
bufresidn (n = 0, A, N-l) used in residual signal xresidn 
1102 processing, and initializes these buffers by means 

25 of Equation (43) and Equation (44) respectively. 
[0199] 

bu/base„=0 (?7 = 0,---,A^-l) [Equation 43 j 



2F04237-PCT 39 
[0200] 

bufresid„=0 (fj = 0,"',Af ~l) [Equation 44] 
[0201] Quadrature transformation processing section 
1103 then finds base layer quadrature transformation 
5 coefficient xbasek 1104 and residual quadrature 
transformation coefficient xresid^ 1105 by performing a 
modified discrete cosine transform {MDCT} on base layer 
decoded signal xbasen 804 and residual signal xresidn 1102 , 
respectively. Base layer quadrature transformation 
10 coefficient xbase^ 1104 here is found by means of Equation 
(45) . 
[0202] 

' Nti L 4JV J V ' ' ; 

[Equation 45] 

15 [0203] Here, xbasen' is a vector linking base layer 
decoded signal xbasen 804 and buffer bufbase^, and 
quadrature transformation processing section 1103 finds 
xbassn' by means of Equation (46). Also, k is the index 
of each sample in one frame. 

20 [0204] 

, , \bujhase„ (« = 0,---A^-]) 
xbase\=\ , .[Equation 46] 

[xbase„_fj [n = N,---2N -\) 

[0205] Next, quadrature transformation processing 
section 1103 updates buffer bufbasej, by means of Equation 
25 (47) . 
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bujbase^=xbase^ (n = 0, ■•■yV - 1) [Equation 47] 
[0207] Also, quadrature transformation processing 
section 1103 finds residual quadrature transformation 
5 coefficient xresidk 1105 by means of Equation (48) . 
[0208] 

N 7^(1 L J 

[Equation 48] 

[0209] Here, xresidn' is a vector linking residual signal 
10 xresidn 1102 and buffer bufresidn, and quadrature 
trans formation processing section 1103 finds xresidn' by- 
means of Equation (49). Also,kisthe index of each sample 
in one frame . 

[0210] 

{btijresid„ {n = (i,-"N-\) 
15 xresid ,={ , , ^ .[Equation 49] 

\xresid„_^ {ti = N,---2N -]) 

[0211] Next, quadrature transformation processing 
section 1103 updates buf f er buf resid-, by means of Equation 
(50) . 
[0212] 

20 bujresid„= xresid „ (« = 0, 1) [Equation 50] 

[0213] Quadrature transformation processing section 
1103 then outputs base layer quadrature transformation 
coefficient Xbasejt 1104 and residual quadrature 
transformation coefficient Xresidk 1105 to vector 

25 quantization section 1106. 
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[0214] Vector quantization section 1106 has, as input, 
base layer quadrature transformation coefficient Xbasejc 
1104 and residual quadrature transformation coefficient 
Xresidk 1105 from quadrature transformation processing 
5 section 1103, and auditory masking characteristic value 
Mjt 1107 from auditory masking characteristic value 
calculation section 203, and using shape codebook 1108 
and gain codebook 1109, performs coding of residual 
quadrature transformation coefficient Xresidk 1105 by 
10 means of vector quantization using the auditory masking 
characteristic value , and outputs enhancement layer coded 
information 806 obtained by coding. 

[0215] Here, shape codebook 1108 is composed of 
previously created Ng kinds of N~dimensional code vectors 
15 coderesidK^ (e = 0, A, Ne-1, k = 0, A, N-1), and is used 
when performing vector quantization of residual 
quadrature transformation coefficient Xresidk 1105 in 
vector quantization section 1106. 

[0216] Also, gain codebook 1109 is composed of previously 
20 created Nf kinds of residual gain codes gainresid^ (f = 
0, A, Nf-1), and is used when performing vector 
quantization of residual quadrature transformation 
coefficient Xresidk 1105 in vector quantization section 
1106 . 

25 [0217] The process performed by vector quantization 
section 1106 will now be described in detail using FIG. 12 . 
In step 1201, initialization is performed by assigning 
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0 to code vector index e in shape codebook 1108, and a 
sufficiently large value to minimum error DistHiN- 
[0218] In step 1202, N-dimensional code vector 

coderesidk^ (k = 0, A, N-1) is read from shape codebook 
1108 . 

[0219] Instep 1203 , residual quadrature transformation 
coefficient Xresidk output from quadrature 
trans formation processing section 1103 is input , and gain 
Gainresid of code vector coderesidk® {k = 0, A, N-l) read 
in step 1202 is found by means of Equation {51}. 
[0220] 

Gainresid = ^ Xresid^ • cod&residl / ^ coderesidl [Equation 51] 

[0221] In step 1204, 0 is assigned to calc_countresid 
indicating the number of executions of step 1205. 

[0222] In step 1205, auditory masking characteristic 
value Mji output from auditory masking characteristic value 
calculation section 203 is input, and temporary gain 
temp2k (k = 0, A, N-l) is found by means of Equation (52) . 

[0223] 

coderesidl \coderesidl ■ Gainresid + Xbase^ | - , 



tempi ^ = 



I, \ (A = 0,-",Af-l) 

0 w:oderesidl • Gainresid + Xbase^ \<M^) 



[Equation 52] 

[0224] In Equation (52), if k satisfies the condition 
I coderesidl/ ■ Gainresid+Xbasek I ^Mk , coderesidK*^ is 
assigned to temporary gain temp2k, and if k satisfies the 
condition ] coderesidk"^ • Gainresid+Xbasek | <Mk / 0 is 
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assigned to temp2k. Here, k is the index of each sample 

in one frame . 

[0225] Then, in step 1205, gain Gainresid is found by 
means of Equation (53) . 
5 [0226] 

Gaim-esid = 'Y,-^residt ■ tempi ^ j'Yjempl^' (A = 0,---wV-1) [Equation 53] 

[0227] If temporary gain temp2}: is 0 for all k's, 0 is 
assigned to gain Gainresid. Also, residual coded value 
Rresidk is found from gain Gainresid and code vector 
10 coderesidjc^ by means of Equation (54). 
[0228] 

Ri-esid), = Gainresid • coderesidl {k = N -\) [Equation 54] 
[0229] Also, addition coded value Rplus^ is found from 
residual coded value Rresidk and base layer quadrature 
15 transformation coefficient Xbasej; by means of 
Equation ( 55) . 
[0230] 

Rpliis^ = Rresid^ + Xbase^ {k = Q,---,N -\)[ Equa t i on 5 5] 
[0231] In step 1206, cal c_count resid is incremented by 
20 1. 

[0232] In step 1207, calc_countresid and a predetermined 
non-negative integer Nresidc are compared, and the process 
flow returns to step 1205 if calc_countresid is a smaller 
value than Nresidc, or proceeds to step 1208 if 
25 calc_countresicJ is greater than or equal to Nresidc- 

[0233] In step 1208, 0 is assigned to cumulative error 
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Distresid, and 0 is also assigned to sample index k . Also, 

in step 1208, addition MDCT coefficient Xplus^ is found 
by means of Equation {56) . 
[0234] 

5 XpluSj. ~ Xbase^ + Xresid^ (/: = 0, ■ ■ ■ , A' - 1 ) [ E q u a t i o n 5 6] 

[0235] Next, in steps 1209, 1211, 1212, and 1214, case 
determination is performed for the relative positional 
relationship between auditory masking characteristic 
value Mk 1107, addition coded value RpluSk , and addition 

10 MDCT coefficient Xplus^, and distance calculation is 
performed in step 1210, 1213, 1215, or 1215 according 
to the case determination result. This case 

determination according to the relative positional 
relationship is shown in FIG. 13. In FIG. 13, a white 

15 circle symbol (o) signifies an addition MDCT coefficient 
XpluSkr and a black circle symbol (•) signifies an addition 
coded value Rplus^. The concepts in FIG. 13 are the same 
as explained for FIG . 6 in Embodiment 1. 
[023SJ In step 1209, whether or not the relative 

20 positional relationship between auditory masking 
characteristic value Mfc, addition coded value Rplus^ , and 
addition MDCT coefficient XplusK corresponds to "Case 1" 
in FIG, 13 is determined by means of the conditional 
expression in Equation (57) . 

25 [0237] 

i\Xplns^\>M^) and i\Rplus ,^\> M ^) and {Xplus^ • RpInS/, >0) 

[Equation 57] 
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[0238] Equation (57) signifies a case in which the 
absolute value of addition MDCT coefficient Xplus^ and 
the absolute value of addition coded value RpluSk are both 
greater than or equal to auditory masking characteristic 
5 valueMk, and addition MDCT coefficient XpluSk and addition 
coded value Rplus^ are the same codes . If auditory masking 
characteristic value Mx, addition MDCT coefficient Xplus^ 
and addition coded value Rplusk satisfy the conditional 
expression in Equation (57), the process flow proceeds 
10 to step 1210, and if they do not satisfy the conditional 
expression in Equation (57), the process flow proceeds 
to step 1211. 

[0239] In step 1210, error Distresidi between Rplus^ and 
addition MDCT coefficient XpluSk is found by means of 
15 Equation (58), error Distresidi is added to cumulative 
error Distresid, and the process flow proceeds to step 
1217 . 
[0240] 

Distresid ^ = Dresid^ , 

= \Xresid^ - Rresid^ \^ ^ ' ° ^ ^ 

20 [0241] in step 1211, whether or not the relative 
positional relationship between auditory masking 
characteristic value Mj^ , add! t ion coded value Rp 1 uSk , and 
addition MDCT coefficient Xplusn corresponds to "Case 5" 
in FIG. 13 is determined by means of the conditional 

25 expression in Equation (59) . 
[0242] 
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(jX/j/j«4|<Mj and (\Rplus^\< M ,) [Equation 59] 
[0243] Equation (59} signifies a case in which the 
absolute value of addition MDCT coefficient Xplus^ and 
the absolute value of addition coded value Rplusn are both 
5 less than auditory masking characteristic value M^. If 
auditory masking characteristic value Mk, addition coded 
value Rplusk, and addition MDCT coefficient XpluSk satisfy 
the conditional expression in Equation (59) , the error 
between addition coded value Rplus^ and addition MDCT 

10 coefficient Xplus^ is taken to be 0, nothing is added to 
cumulative error Distresid, and the process flow proceeds 
to step 1217. If auditory masking characteristic value 
Mr, addition coded value Rplusk, and addition MDCT 
coefficient Xplus^ do not satisfy the conditional 

15 expression in Equation (59), the process flow proceeds 
to step 1212. 

[0244] In step 1212, whether or not the relative 
positional relationship between auditory masking 
characteristic value M^, addition coded value Rplus^, and 
20 addition MDCT coefficient Xplusk corresponds to "Case 2" 
in FIG. 13 is determined by means of the conditional 
expression in Equation (60). 
[0245] 

25 i^Xplus, \>M,) and i^Rplus^ | ^ ^* ) arid {Xplus^ ■ Rplus, < O) 

[Equation 60] 

[0246] Equation (60) signifies a case in which the 
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absolute value of addition MDCT coefficient Xplus^ and 
the absolute value of addition coded value RpluSk are both 
greater than or equal to auditory masking characteristic 
value Mk , and addition MDCT coefficient Xplusk and addition 
5 coded value RpluSk are different codes. If auditory 
masking characteristic value Mk, addition MDCT 
coefficient XpluSk, and addition coded value Rplus^ 
satisfy the conditional expression in Equation (60) , the 
process flow proceeds to step 1213, and if they do not 
10 satisfy the conditional expression in Equation (60) , the 
process flow proceeds to step 1214, 

[0247] In step 1213, error Distresida between addition 
* coded value Rplusj^ and addition MDCT coefficient Xplus^ 
is found by means of Equation (61), error Distresid2 is 
15 added to cumulative error Distresid, and the process flow 
proceeds to step 1217. 
[0248] 

Disiresid^ = Dreski^^^ + Dresid^i + Pn-^ui * Dresid^;^ [Equation 61] 
[024 9] Here, presid is a value set as appropriate according 
20 to addition MDCT coefficient Xplusk , addition coded value 
RplusK, and auditory masking characteristic value Mi^ . A 
value of 1 or less is suitable for presid- Dresid2i, Dresid22, 
and Dresid23 are found by means of Equation (62} , Equation 
(63) , and Equation (64) , respectively. 
25 [0250] 

Dresid^, = \-^plus^ | - Jlif [ E qua t i on 6 2] 

[0251] 
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Dresid^j = {^plus^ | - M^. [ E qua t i on 6 3] 

[0252] 

Dresid^- =M^ -2 [Equation 64] 
[0253] In Step 1214, whether or not the relative 
5 positional relationship between auditory masking 
characteristic value Mit , addition coded value Rplusk, and 
addition MDCT coefficient Xplusk corresponds to "Case 3" 
in PIG. 13 is determined by means of the conditional 
expression in Equation (65). 
10 [0254] 

l\Xplus^\> M^) and (^Rplm < M ,) [Equation 6 5] 
[0255] Equation (65) signifies a case in which the 
absolute value of addition MDCT coefficient XpluSk is 
greater than or equal to auditory masking characteristic 

15 value Mk^ and addition coded value Rplusk is less than 
auditory masking characteristic value Mk . If auditory 
masking characteristic value Mk , addition MDCT 
coefficient XpluSk, and addition coded value RpluSk 
satisfy the conditional expression in Equation (65) , the 

20 process flow proceeds to step 1215, and if they do not 
satisfy the conditional expression in Equation (65) , the 
process flow proceeds to step 1216. 

[0256] In step 1215, error Distresidj between addition 
coded value RpluSk and addition MDCT coefficient XpluSk 
25 is found by means of Equation (66), error Distresida is 
added to cumulative error Distresid, and the process flow 
proceeds to step 1217. 
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[0257] 

Distresid-^ = Dresid-^ 

[0258] In step 1216, the relative positional 
relationship between auditory masking characteristic 
5 value M^, addition coded value Rplus^, and addition MDCT 
coefficient Xplusjj corresponds to "Case 4" in FIG. 13, and 
the conditional expression in Equation (67) is satisfied. 
[0259] 

\Xplus^\<M^) and \RphiS i.\> M ^) [Equation 67] 
10 [0260] Equation (67) signifies a case in which the 
absolute value of addition MDCT coefficient Xplusx is less 
than auditory masking characteristic value Mk, and 
addition coded value Rplus^ is greater than or equal to 
auditory masking characteristic value Mk . In step 1216, 
15 error Distresid^ between addition coded value Rplus^ and 
addition MDCT coefficient Xplus^ is found by means of 
Equation (68), error Distresidi is added to cumulative 
error Distresid, and the process flow proceeds to step 
1217 . 
20 [0261] 

Distresid ^ = Dresid^^ 

. , [Equation 68] 

[0262] In Step 1217, k is incremented by 1. 
[0263] In Step 1218, N and k are compared, and if k is 
a smaller value than N, the process flow returns to step 
25 1209. If k is greater than or equal to N, the process 
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flow proceeds to step 1219. 

[0264] In step 1219, cumulative error Distresid and 
minimum error DistresidMiK are compared, and if cumulative 
error Distresid is a smaller value than minimum error 
5 Distresid«ijN , the process flow proceeds to step 1220, 
whereas if cumulative error Distresid is greater than 
or equal to minimum error Distresidwis/ the process flow 
proceeds to step 1221. 

[0265] In step 1220, cumulative error Distresid is 
10 assigned to minimum error Distresidms , e is assigned to 
gainresid_indexMiN / and gain Distresid is assigned to 
error minimum gain DistresidMis, and the process flow 
proceeds to step 1221. 

[0266] In step 1221, e is incremented by 1. 

15 [0267] In step 1222, total number of vectors Ne and e 
are compared, and if e is a smaller value than Ne, the 
process flow returns to step 1202. If e is greater than 
or equal to N^, the process flow proceeds to step 1223. 
[0268] In step 1223, Nf kinds of residual gain code 

20 gainresid^ (f = 0, A, Nf-1) are read from gain codebook 
1109, and quantization residual gain error gainresiderr^ 
(f = 0, A, Nf-1) is found by means of Equation (69) for 
all f ' s . 
[0269] 

25 gaiw-esiderr^ =^Gairiresid,^f,;^, - gainresid^^ {/ = 0,---,A'^/ -l) [Equation 69] 
[0270] Then, in step 1223, f for which quantization 
residual gain error gainresiderr" (f = 0, A, Nf-1) is a 
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minimum is found, and the found f is assigned to 
gainresid_indexKiK . 

[0271] In step 1224, gainres id_index>5TN that is the code 
vector index for which cumulative error Distresid is a 
5 minimum, and gainres id_indexKiN found in step 1223, are 
output to transmission channel 807 as enhancement layer 
coded information 806, and processing is terminated. 
[0272] Next, enhancement layer decoding sect ion 8 10 will 
be described using the block diagram in FIG. 14. In the 

10 same way as shape codebook 1108, shape codebook 1403 is 
composed of Ne kinds of N-dimensional code vectors 
gainresidk'' (e = 0, A, Ne-1, k = 0, A, N-1), and in the 
same way as gain codebook 1109, gain codebook 1404 is 
composed of Nf kinds of residual gain codes gainresid^ 

15 (f = 0, A, Nf-1) . 

[0273] Vector decoding section 1401 has enhancement 
layer coded information 806 transmitted via transmission 
channel 807 as input, and using gainresid_index«iN and 
gainresid_index„iK as the coded information, reads code 

20 vector coderesidK^"'*'''"^'''*-'"'^^''"''' (k = 0, A, N-1) from shape 
codebook 1403 , and also reads code gainresid^^'"''^^'''-""'^^''"^'^ 
from gain codebook 1404. Then, vector decoding section 
1401 multiplies gainresid^^^"^^^^^-^"^''''"'" by 

coderesidk'^'"'^^''^^''*-""'*^'""" (k = 0, A, N~l), and outputs 

25 gainresid5^^"^^^^'^-i"^^=''"« • coderes id),"'^"*^^^--"'^''^''" (k = 0, 
A, N-1) obtained as a result of the multiplication to 
a residual quadrature transformation processing section 
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1402 as a decoded residual quadrature transformation 
coefficient . 

[0274] The process performed by residual quadrature 
transformation processing section 1402 will now be 
5 described. 

[0275] Residual quadrature transformation processing 
section 1402 has an internal buffer bufresid^' , and 
initializes this buffer in accordance with Equation (70) . 
[0276] 

10 bnfresid\^0 = 0,-",A^- 1) [Equation 70] 

[0277] Decoded residual quadrature transformation 
coefficient gainresid^^^"'"'^'^'^-^"'^^^"'^ 
coderesidk"''''^'^^^''-^"'*^^"^" {k = 0, A, N-i) output from vector 
decoding section 1401 is input, and enhancement layer 

15 decoded signal yresidn 811 is found by means of Equation 
(71> . 
[0278] 

yresid^ = — > Xresid\ cos -^^ '-^ — (» = 0, ■ - • , - ) 

[Equation 71] 

20 [0279] Here, Xresidk' is a vector linking decoded 
residual quadrature transformation coefficient 

gainresid^^^"^'^^^'^-^^^'^'^^^"^" • coderesidk"^^^^''''"^''^^^'"'''' {k = 0, 
A, N-1) and buffer bufresidk', and is found by means of 
Equation ( 72 ) . 
25 [0280] 
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Xresid',. = 



l; 



\bufresid\ {k = 0,---N-\) 

[gainresid^"'"^'"''-"^""'' • coderesid^^"'"' {k = N,--'2N -\) 



[Equation 72] 



[0281] Buffer bufresidk' is then updated by means of 

Equation { 73 ) . 

[0282] 

bufresid\ = gainresid"""""' -'"'"""^ ■ coderesidf"'"'' {k = 0,'-N-\) 



[0283] Enhancement layer decoded signal yresidn 811 is 
then output. 

[0284] The present invention has no restrictions 
concerning scalable coding layers , and can also be applied 
to a case in which vector quantization using an auditory 
masking characteristic value is performed in an upper 
layer in a hierarchical voice coding and decoding method 
with three or more layers. 

[0285] In vector quantization section 1106, 

quantization may be performed by appllying acoustic 
weighting filters to distance calculations in 
above-described Case 1 through Case 5. 

[0286] In this embodiment, a CELP type voice coding and 
decoding method has been described as the voice coding 
and decoding method of the base layer coding section and 
decoding section by way of example, but another voice 
coding and decoding method may also be used. 

[0287] Also, in this embodiment, an example has been 
given in which base layer coded information and 



[Equation 73] 
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enhancement layer coded information are transmitted 
separately, but a configuration may also be taken, whereby 
coded information of each layer is transmitted 
multiplexed, and demultiplexing is performed on the 
5 receiving side to decode the coded information of each 
layer , 

[0288] Thus, in a scalable coding system, also, applying 
vector quantization that uses an auditory masking 
characteristic value of the present invention makes it 

10 possible to select a suitable code vector that minimizes 
degradation of a signal that has a large auditory effect, 
and obtain a high-quality output signal. 
[0289] (Embodiment 3) 

PIG. 15 is a block diagram showing the configuration 

15 of a voice signal transmitting apparatus and voice signal 
receiving apparatus containing the coding apparatus and 
decoding apparatus described in above Embodiments 1 and 
2 according to Embodiment 3 of the present invention. 
More specific applications include mobile phones, car 

20 navigation systems, and the like. 

[0290] In FIG. 15, input apparatus 1502 performs A/D 
conversion of voice signal 1500 to a digital signal, and 
outputs this digital signal to voice/musical tone coding 
apparatus 1503 . 

25 Voice/musical tone coding apparatus 1503 is equipped with 
voice/musical tone coding apparatus 101 shown in FIG.I, 
codes a digital signal output from input apparatus 1502, 
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and outputs coded information to RF modulation apparatus 
1504 . RF modulation apparatus 1504 converts voice coded 
information output from voice/musical tone coding 
apparatus 150 3 to a signal to be sent on propagation medium 
such as a radio wave, and outputs the resulting signal 
to transmitting antenna 1505. 

Transmitting antenna 1505 sends the output signal output 
from RF modulation apparatus 1504 as a radio wave (RF 
signal) . RF signal 1506 in the figure represents a radio 
wave (RF signal) sent from transmitting antenna 1505. 
This completes a description of the configuration and 
operation of a voice signal transmitting apparatus. 
[0291] RF signal 1507 is received by receiving antenna 
1508, and is output to RF demodulation apparatus 1509. 
RF signal 1507 in the figure represents a radio wave 
received by receiving antenna 1508, and as long as there 
is no signal attenuation or noise superimposition in the 
propagation path, is exactly the same as RF signal 1506. 
[0292] RF demodulation apparatus 1509 demodulates voice 
coded information from the RF signal output from receiving 
antenna 1508, and outputs the result to voice/musical 
tone decoding apparatus 1510. Voice/musical tone 
decoding apparatus 1510 is equipped with voice/musical 
tone decoding apparatus 105 shown in FIG.l, and decodes 
a voice signal from voice coded information output from 
RF demodulation apparatus 1509. Output apparatus 1511 
performs D/A conversion of the decoded digital voice 
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signal to an analog signal , converts the electrical signal 
to vibrations of the air, and outputs sound waves audible 
to the human ear. 

[0293] Thus, a high-qual ity output signal can be obtained 
5 in both a voice signal transmitting apparatus and a voice 

signal receiving apparatus. 

[0294] The present application is based on Japanese 
Patent Application No . 2 0 0 3 - 4 3 3 16 0 filed on December 26, 
2003, the entire content of which is expressly 
10 incorporated herein by reference. 



Industrial Applicability 

[0295] The present invention has advantages of selecting 
a suitable code vector that minimizes degradation of a 

15 signal that has a large auditory effect, and obtaining 
a high-quality output signal by applying vector 
quantization that uses an auditory masking characteristic 
value. Also, the present invention is applicable to the 
fields of packet communication systems typified by 

20 Internet communications, and mobile communication 
systems such as mobile phone and car navigation systems. 



